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Introduction 

RTE-A combined with VC+ and the A600+, A700, and 
A900 processors presents Hewlett-Packard's most ad- 
vanced family of real-time computer systems. Effective 
with revision 2440, RTE-A supports up to 48 I/O chan- 
nels and optional partitioning of drivers and certain 
operating system modules to gain more space for tables 
in large configurations. This brief provides new informa- 
tion and brings information in the previous RTE-A per- 
formance brief up to date, based on tests with the 2440 
version of RTE-A. It is important to note, however, that 
the numbers in this performance brief are not compar- 
able with those published in previous RTE-A perfor- 
mance briefs because of improvements in testing 
procedures. 

There are six sections in this performance brief: 

1. Overview. A look at the A-Series processors. 

2. File System. Comparison of programmatic perfor- 
mance of the new and old file systems. 

3. I/O Performance. Logic analyzer measurements, in- 
cluding comparison of I/O in the CPU and in the 
I/O Extender. 

4. CDS. (Code & Data Separation) Performance com- 
parisons of scheduling CDS and non-CDS programs. 

5. RTE-A Partitioning. Additional execution times 
incurred by partitioning RTE-A to gain table space. 

6. Whetstone Benchmarks. Benchmark times for the 
A600+, A700, and A900 processors. 

The information in this brief is based on logic analyzer 
measurements of particular calls or processing intervals. 
Variable factors affecting results are discussed. The num- 
bers are not absolute and will vary with configurations. 



Section 1 — A-Series Overview 

Base instruction set processing speeds of the A 6 00+ and 
A 7 00 processors are about the same, although execution 
time may be a bit faster on one computer or the other, 
depending upon the instruction mix of the program. 

The A 7 00 processor can optionally be equipped with a 
hardware floating point processor for floating point 
computation speeds that are more than triple the firm- 
ware based floating point speeds of the A600+. This 
performance difference is further reinforced by Scien- 
tific and Vector Instruction Set (SIS and VIS) firmware, 
which is much faster than the software equivalents in 
the A 6 00+. The A 7 00 also supports microprogramming 
of frequently -used software routines by the user for a 
3-to- 10-fold speed-up for optimizing performance. All 



A 7 00 data in this performance brief is taken from an 
A700 Computer with hardware floating point, SIS, and 
VIS. 

Standard hardware floating point and SIS and VIS firm- 
ware help make the A900 one of the best price/perfor- 
mance 16/ 3 2 -bit machines available today. Sixteen-bit 
external paths make the A900 a compatible member of 
the A-Series family, while 3 2 -bit internal paths give it 
the performance to compete against higher-priced wide- 
word machines. 

Pipelining in the A 9 00 CPU provides fast parallel pro- 
cessing of instructions, using a cache memory to provide 
fast access. The A 9 00 processor provides three times the 
speed of an A 7 00 in computation -intensive applications. 
With the A-Series I/O and the fast CPU, I/O intensive 
applications will normally be equal to or faster than the 
A600+ or A700. However, the A900 especially excels at 
integer or floating point "number crunching" for uses 
that involve loops or repetitions which utilize the cache 
memory to the fullest. 

Table 1 shows how the respective A-Series computers 
compare when executing various selected system services 
under RTE-A. The instruction set and execution times 
of the A-Series computers are listed in Table 4 of the 
HP 1000 A-Series Computer Design and Specifications 
data sheet of the HP 1000 A-Series Hardware Technical 
Data book, literature request number 5953-8761, or a 
later revision. 



Section 2 — File System 

Table 2 shows file system performance figures for the 
A600+, A700, and A900 processors using a 7912 disc. 
To differentiate between the file calls and hierarchical 
directory structure introduced with RTE-A revision 
2326 and the structure used in previous versions of RTE, 
and still supported, the new is referred to as FMP and 
the old as FMGR. Table 2 compares all possible permu- 
tations, including new (FMP) file calls to new (FMP) disc 
LUs, new (FMP) file calls to old (FMGR) disc LUs, and 
old (FMGR) file calls to old (FMGR) disc LUs. 

The create and open times in Table 2 are measured for a 
file being created or opened that is located in the same 
directory as the type 6 (program) file, as the first entry 
in that directory, which has no extents. These constraints 
minimize disc access time, while providing a consistent 
basis for measurement. Create and open times will other- 
wise be longer, varying with directory size and extents, 
file entry location in the directory, and with the need 
(if any) to access a different directory. 



Table 1. Logic Analyzer Measurements of Selected Sys- 
tem Services (execution times in microseconds) 



OPERATION 


A600+ 


A700 


A900 


Write 1 byte to terminal, 
unbuffered 


4,510 


4,700 


2,800 


Write 80 bytes to ter- 
minal, unbuffered" 1 


88,110 


88,360 


85,660 


Write 1 byte to HP-IB, 
unbuffered 


3,230 


3,430 


1,690 


Write 80 bytes to HP-IB, 
HP-IB, unbuffered 


3,430 


3,620 


1,790 


Schedule Real-Time 
program resident in 
main memory with wait 
( t ime f rom Exec9 to 
ent ry of son program) 


1,340 


1,420 


700 


Schedule Background pro- 
gram from Real-Time pro- 
gram without wait (time 
from ExeciO until return) 


927 


1,007 


517 


Class GET (1 word) 


700 


827 


354 


Class GET (80 words) 


800 


905 


410 


Go priv using SLIBR 


80 


105 


51 


Go unpriv using $LIBX 


156 


192 


76 


Go priv using GOPRV** 


93 


126 


47 


Go unpriv using UNPRV** 


45 


49 


18 


Dispatchlock 


169 


211 


80 


Dispatchunlock 


198 


242 


93 


Time Base Gen overhead 
with no timeouts 


190 


223 


79 


Time Base Gen overhead 
with one timeout entry 


199 


231 


82 



* Transfer of 78 bytes instead of 80 bytes saves 3 
to 4 milliseconds (3000-4000 microseconds) over 
these times because the device driver can send a 
78-byte record to the interface driver in one 
call, vs two required for 80 bytes. 

** The new GOPRV/UNPRV services execute about 40% 
faster than the old (and still supported) $LIBR/ 
SLIBX combination. 



Table 2 read and write times are based on having the 
Data Control Block (DCB) already in main memory to 
exclude disc access times and thereby assure consistent 
data. 

A 40-word record size was used because it is most typical 
of actual usage. Each word written or read accounts for 
about 1 microsecond of of the read/write times. 



The CREATE call for the new file system is slower due 
to more directory information. However, the READ and 
WRITE calls are about 30% faster, which can yield 
significant file I/O performance increases. A perfor- 
mance concern of the FMGR file system dealing with 
the size of the DCB and the file has been alleviated with 
the FMP file system. In the FMGR format, the files are 
pulled into the DCB in equal sized chunks. Consequently, 
the entire DCB is used only if the file size is an integer 
multiple of the DCB size. The FMP file system uses 
all of the DCB regardless of the size of the file, and 
brings in the remainder of the file on the last access, 
minimizing the number of disc accesses. 



Table 2. File System Performance Comparison (execution times in milliseconds) 





A600+ 


A700 


1 
A900 | 

1 


j Type of File Accessed « 
I Type of File Call Made * 


FMP 
FMP 


FMGR 
FMP 


FMGR 
FMGR 


FMP 
FMP 


FMGR 
FMP 


FMGR 
FMGR 


FMP 
FMP 


FMGR 
FMP 


1 
FMGR | 
FMGR | 


1 1. CREATE & OPEN FIRST FILE IN DIRECTORY, 
j Type * 2, Size ■ 2 Blks, Rec * 40 words 


139 


75 


45 


137 


76 


4. 


102 


56 


1 
42 | 

1 


I 2. READ 2nd RECORD IN FILE (DCB already in 
j memory) 


0.343 


0.344 


0.527 
J 


0.399 


0.393 


0.504 


0.145 


0.152 


0.223 | 

1 


j 3. WRITE A RECORD TO A FILE (DCB already 
1 in memory) 


0.340 


0.340 


! 0.516 


0.392 


0.388 


0.496 


0.142 


0.143 


0.185 I 


I 4. OPEN A FILE, Update = FMGR, RWOQ * FMP 


28 


64 


! 48 


31 


65 


1 48 


12 


45 


45 ! 



Section 3 — I/O Performance 

Table 3 provides detailed data on buffered, unbuffered, 
and class I/O within RTE-A. Interrupt response time 
(time from the occurrence of an interrupt to execution 
of the first instruction in the driver) is also specified 
in Table 3. Table 4 compares I/O execution times in the 
CPUs and I/O Extenders for four different A -Series in- 
terfaces. 



Table 3. Detailed Analysis of I/O Processing Overhead Using 64000 Logic Analyzer 



OPERATION 



l_ 



INITIATION (milliseconds) 



I CALL 

I EXEC 

I TO 

I DEV 

I DRVR 



| DEVICE 

| DRIVER 

I TO I/F 

I DRVR 
I 
I 

I (a&b) 

J 



CALL 
EXEC 
TO I/F 
DRIVER 



I/F | START | RETURN 



TO 



I DMA 



START | TO 
DMA | EXIT 
I SYST 



I 

I (a) | (c) 



I TO 
| CALLER 
I AFTER 
I INIT. 

I 



COMPLETION (milliseconds) 



I 
INTRPT | I/F 
RESP. | DRIVER 
TIME | TO DEV 
(Inter. | DRIVER 
to I/F | 
Driver) j 



"1 I 

| DEVICE | I/F 

I DRIVER | DRIVER 

I TO EXIT | TO EXIT 

I SYSTEM | SYSTEM 



(a) | (a & d) | 



RTE-A on A600+ 



r 

335 | 

351 | 

145 | 

163 | 

.145 | 

163 | 

145 | 

160 | 

145 | 

.163 | 



UNBUF. I/O TO TERMINAL | 

UNBUF. I/O TO HP-IB | 

BUFFERED I/O TO TERMINAL | 

BUFFERED I/O TO HP-IB | 

CLASS I/O TO TERMINAL(f) | 

CLASS I/O TO HP-IB(f ) | 
UNBUF CLASS I/O TO TER(f)| 

UNBUF CL I/O TO HP-IB(f) | 

CLASS RETHREAD TO TERMNL | 

CLASS RETHREAD TO HP-IB | 



0.900 

(e) 

1.020 

(e) 

1.222 

(e) 
1.284 

(e) 
1.135 

(e) 



1.029 

(e) 

0.863 

(e) 

0.870 

(e) 

1.051 

(e) 

0.867 

(e) 



1.330 
1.287 
1.495 
1.739 
1.407 



0.243 
0.445 
0.233 
0.435 
0.240 
0.444 
0.253 
0.428 
0.240 
0.443 



(c) 

(c) 

2.261 

1.888 

2.478 

2.101 

735 

322 

388 



2.014 



132 
132 
131 
130 
132 
131 
130 
131 
130 
131 



0.418 

(e) 

0.418 

(e) 

0.418 

(e) 

0.418 

0*417 
(e) 



0.730 

(e) 

0.514 

(e) 

0.552 

(e) 

0.578 

(e) 

0.648 

(e) 



I 



.1. 



RTE-A on A700 



I 0.951 | 

I I 

I 0.741 | 



I 



I 0.772 | 

I I 

I 0.797 | 

I I 

I 0.777 | 



J 



UNBUF. I/O TO TERMINAL | 0.975 
UNBUF. I/O TO HP-IB | (e) 
BUFFERED I/O TO TERMINAL | 1.155 
BUFFERED I/O TO HP-IB | (e) 
CLASS I/O TO TERMINAL(f) | 1.403 
CLASS I/O TO HP-IB(f) | (e) 
UNBUF CLASS I/O TO TER(f)| 1.452 
UNBUF CL I/O TO HP-IB(f) | (e) 
CLASS RETHREAD TO TERMNL | 1.300 
CLASS RETHREAD TO HP-IB | (e) 
I 



1.004 

(e) 

0.888 

(e) 

0.901 

(e) 

1.036 

(e) 

0.892 

(e) 



I I 

I 0.256 | 0.349 

1.369 | 0.477 | 0.368 

I 0.246 | 0.170 

1.433 | 0.467 | 0.192 

| 0.256 | 0.170 

1.687 | 0.474 | 0.192 

I 0.271 | 0.169 

1.873 | 0.460 | 0.187 

| 0.254 | 0.170 

1.580 | 0.474 | 0.191 
I I 



(c) 

(c) 

2.458 
086 
716 
353 
926 
524 
615 
245 



148 
148 
148 
148 
148 
148 
147 
148 
148 
147 



0.454 

(e) 

0.454 

(e) 

0.454 

(e) 

0.455 

(e) 

0.455 

(e) 



0.783 

(e) 

0.601 

(e) 

0.651 

(e) 

0.683 

(e) 

0.757 

(e) 



I 

I I 

I 1.044 | 



I 



RTE-A on A900 



0.862 | 



I 
I 

I I 

I 0.915 | 

I I 

| 0.944 | 



I 



I 0.914 | 



1 I 

I I 

I 0.501 | 

I I 

I 0.378 | 



UNBUF. I/O TO TERMINAL 
UNBUF. I/O TO HP-IB 
BUFFERED I/O TO TERMINAL 
BUFFERED I/O TO HP-IB 
CLASS I/O TO TERMINALS ) 
CLASS I/O TO HP-IB(f) 
UNBUF CLASS I/O TO TER(f) 
UNBUF CL I/O TO HP-IB(f) 
CLASS RETHREAD TO TERMNL 
CLASS RETHREAD TO HP-IB 




I 

0.703 | 

I 



0.754 | 

I 



127 | 0.179 | 

225 | 0.179 | 

' 0.092 | 

0.097 | 

0.092 | 



.126 | 

.225 | 

.131 | 

0.795 j 0.236 | 0.097 j 

I 0.137 | 0.091 | 

0.872 | 0.224 | 

0.132 



I 



0.094 
0.091 



0.675 | 0.234 | 0.099 | 



_l_ 



J. 




0.209 

0*228 

(e) 

0.227 

(e) 

0.227 

(e) 

0.228 

(e) 



0.381 

(e) 

0.289 

(e) 

0.322 

(e) 

0.339 

(e) 

0.363 

(e) 



I 



I 



| 0.415 | 

i 0.434 | 

I I 

I 0.399 | 



NOTES: 

a. Times in this column will increase if there are timeouts. 

b. When the I/O Extender is used, additional overhead is incurred for allocation of I/O Port Maps, as follows: 
33 microseconds in A600+, 37 microseconds in A700, 18 microseconds in A900. 

Return to caller after initiation is valid only for buffered calls. 

When the I/O Extender is used, additional overhead is incurred for deallocation of I/O Port Maps, as follows: 
30 microseconds in A600+, 35 microseconds in A700, 18 microseconds in A90. 

HP-IB operations were tested directly, without a device driver. 

Class I/O times do not include the time to allocate the class number. 



Table 4. CPU - I/O Extender I/O Performance Comparison (execution times in microseconds) 



INTERFACE 


OPERATION 


A600+ EXECUTION TIME 


A700 EXECUTION TIME 


A900 EXECUTION TIME 


IN CPU 


IN I/O 
EXTENDER 


IN CPU 


IN I/O 
EXTENDER 


IN CPU 


IN I/O 
EXTENDER 


12006A PARALLEL 
INTERFACE CARD 


1-CHARACTER READ 
80-CHARACTER READ 
1000-WORD READ 

1-CHARACTER WRITE 
80-CHARACTER WRITE 
1000-WORD WRITE 


360 

373 

1663 

359 

373 

1938 


362 

376 

1647 

363 

374 

2592 


379 

399 

1812 

381 

399 

2137 


386 

403 

1819 

385 

400 

2868 


187 

194 

1926 

187 

193 

2486 


190 

199 

1936 

189 

197 

3312 


12005B ASYNC 
SERIAL INTER- 
FACE CARD 


1-CHARACTER WRITE 

80-CHARACTER WRITE 

128-CHARACTER WRITE 


1051 

83300 

133300 


1058 

83340 

133300 


1060 

83320 

133300 


1066 

83320 

133300 


1060 

83320 

133300 


1067 

83350 

133300 


12009A HP-IB 
INTERFACE CARD 


1-CHARACTER WRITE 
80-CHARACTER WRITE 
1000-WORD WRITE 


362 

374 

1690 


374 

383 

2434 


388 

404 

1900 


398 

411 

2654 


189 

203 

2275 


191 

204 

3093 


12040C ASYNC 
MULTIPLEXER 
INTERFACE CARD 


1-CHARACTER WRITE 

80-CHARACTER WRITE 

128-CHARACTER WRITE 


1446 
1848 
2064 


1440 
1841 
2057 


1429 
1831 
2048 


1425 
1824 
2044 


1751 
2151 
2367 


1726 
2128 
2343 



NOTES: 

A. Times listed represent the time required for the actual transfer of data from the start of DMA until interrupt 
of RTE-A signalled completion. 

B. Additional overhead of I/O Extender on writes as compared to the CPU shows up mainly with transfers of larger 
blocks of data via fast interfaces, such as the 12006A Parallel Interface and the 12009A HP-IB interface. 
With slower interfaces, such as the 12005B Async Serial Interface and the 12040C Multiplexer Interface, the 
additional overhead of the I/O Extender shows very little effect upon overall throughput, regardless of the 
size of the block being transferred. 

C. Tests with the 12005B and 12040C interfaces bypassed the device driver to eliminate subdivision of data blocks 
into 78 character blocks by the device driver. 

D. The test data was the same in either a first I/O Extender or a second I/O Extender. 



Section 4 — CDS Programs 

This section deals with the performance of programs 
using the code and data separation feature of VC+. The 
test was done using one program (DOSR) to schedule 
another program (SR) which terminates, saving resources. 
It is already in memory when scheduled. The test was 
run with these permutations of three variables: 

1. Scheduling - DOSR schedules SR with wait (EXEC9) 

or without wait (EXEC 10). 

2. Priority - SR is always priority 99, and DOSR is 

higher (priority 90), or lower 
(priority 101). 

3. Execution - The test is done with both programs as 

non-CDS, both CDS, both CDS and SR 
shared, and both CDS and both shared. 



Table 5 reflects the difference in scheduling time due to 
priorities and type of scheduling call (with or without 
wait). Additionally, you can see, first hand, the addi- 
tional overhead incurred with cross-map calls when the 
code and data are in separate maps, and with additional 
table checking when the programs are shared. 



Table 5. CDS Scheduling Times (in microseconds) 



OPERATION 

1 


1 
1 

1 


DOSR EXEC9 to I 
SR entry 


DOSR EXEC9 to | 
return to DOSR j 

1 


DOSR EXEC10 to 
SR entry 


DOSR EXEC10 to 
return to DOSR 


1 

1 


1 1 
| DOSR PRIORITY > 

1 


90 

1 


1 
101 


90 | 

1 


1 
101 I 

1 


90 


101 


90 


101 


RTE-A/VO ON A600+ 


1 

j NON-CDS 

| BOTH CDS 





1348 
1575 
1585 
1585 


1345 
1575 
1585 
1586 


2474 ! 
2910 
2931 
2953 


1 
2471 | 
2921 I 
2931 
2956 

1 


1958 
2225 
2218 
2241 


1264 
1505 
1501 
1501 


921 
950 
938 
960 


2303 
2766 


| BOTH CDS (SR SHARED) 





2761 


I BOTH CDS (BOTH SHARED) 




2784 


1 







RTE-A/VC+ ON A700 


NON-CDS 

BOTH CDS 


:::::::.: 


1476 
1602 
1633 
1634 


1478 
1602 
1632 
1634 


2671 
2970 
3004 
3032 


2672 
2970 
3003 
3032 


2095 
2305 
2331 
2353 


1334 
1519 
1546 
1547 


997 
1024 
1024 
1052 


2434 
2799 


BOTH CDS (SR SHARED) 




2825 


BOTH CDS (BOTH SHARED) 




2849 






RTE-A/VC+ ON A900 


NON-CDS 

BOTH CDS 


605 
683 
700 
695 


607 
683 
700 
698 


1107 
1274 
1297 
1300 


1109 
1274 
1299 
1300 


911 
1011 
1024 
1036 


571 
656 
670 
667 


I 431 
I 446 
! 446 
! 454 

1 


1052 
1213 


BOTH CDS (SR SHARED) 


1227 


BOTH CDS (BOTH SHARED) 




1 1233 




1 



NOTE: 



SR already in memory when scheduled. 



Section 5 — RTE-A Partitioning 

The excellent capabilities of the A -Series computers and 
RTE-A have led to configuration of increasingly large 
systems. With this trend, some users have encountered 
a problem with limited space for system tables, which 
occupy the lowest 64k bytes of physical memory along 
with the RTE-A system itself. To make more table 
space available, RTE-A revision 2440 provides for the 
partitioning of drivers and the RTE-A system modules 
listed in Table 6 outside of the lowest 64k bytes of 
physical memory. When fully used, this partitioning 
makes 6 more pages of memory available for system 
tables than would be available in a non -partitioned 
system, 3 more pages than are available in the primary 
system. Memory is gained for table space in this way at 
the cost of the time required for jumping to and returning 
from the partitioned routines. These times, which include 
memory mapping changes, are listed in Table 7. 



Table 6. Partitionable RTE-A System Modules 



MODULE | ENTRY POINTS 


TYPE OF CALL 


CDSFH | $CDSF, $VMAF 
I SCDSS0, SCDSX 
I STOG.CDS 


Trap 

Jmp 

JsbO, ReturnO 


DSQ I SEX34 


Jmp 


LOAD | $COMP, $EX08, SEX28 

1 SDREL, $LDAB, SLOAD, SLULK, 
1 SPREL, $S0RL, $SR13, 
I $ TOG. LOAD 

1 SSREL 


Jmp 

JsbO, ReturnO 

Jsbl, Returnl 


MEMRY | SEX22 

I SGBLK, $GFRE, SGTSZ, SMFRE , 
I SSHAREPR, STOG.MEMY, SUNMK, 
1 SXE02, $XEQ5 


Jmp 

JsbO, ReturnO 


PERR | SDMPR 
I SPRTY 
I STOG.PERR 


Jmp 

Trap 

JsbO, ReturnO 


STAT | SDSRQ, $EX26, SEX33 , SPSRQ 


Jmp 


VEMA | SEMCK, SSETMC, SSETMU, 
I SSETVC 


JsbO, ReturnO 


XCMND | CKUSR 

| $ASRQ, SBRRQ, SCDRQ, SDNRQ, 
I $DTRQ, SGORQ, $PRRQ, SSSRQ, 
I SSZRQ, SULRQ, SVSRQ, SWSRQ 


JsbO, ReturnO 
Jmp 



Table 7. Driver -Operating System Partition Mapping 
(execution times in microseconds) 



I" 



OPERATION 



A600+ | A700 | A900 



DRIVER MAPPING 



j Map Driver 
Remap Driver 



25.4 



i r 

| 20.0 | 



24.5 | 21.2 
I 



5.4 
8.7 



MAPPING IN, AND RETURNING 
OPERATING SYSTEM MODULES 


FROM, 


PARTITIONED 






JSBO Mapping/Tag routine 


1 «s 


8 


42.2 | 


26 





JSB1 Mapping/Tag routine 


1 

I 65 


4 


1 
55.2 | 


32 


2 


ReturnO Remapping/Tag 
routine 


I 40 


9 


38.5 | 

1 


18 


1 


Returnl Remapping/Tag 
routine 


1 52 


3 


40.0 I 

1 


18 


9 


JMP Mapping/Tag routine 


I 26 


1 


1 1 
23.0 | 


14 





Trap Mapping/Tag routine 


I 64 

1 


3 


76.0 | 
1 1 


36 


2 



from an analysis of one thousand ALGOL 60 programs 
as an attempt to represent an average program instruc- 
tion mix. The algorithm includes many different opera- 
tions (floating point and integer calculations, transcen- 
dental functions, array manipulation, and conditional 
jumps). The universality of this algorithm has been 
substantiated by subsequent analysis of FORTRAN 
programs. It was designed to be non-optimizable so that 
it would test the CPU and not the compiler. The execu- 
tion of the Bl and BID programs can be expressed in 
terms of a somewhat arbitrary entity known as a 
Whetstone instruction. 

One loop through these programs represents 1,000,000 
Whetstone instructions. Since we loop through the pro- 
gram 10 times, the times for Bl and BID in Table 8 
represent the time to execute 1 0,000,000 Whetstone 
instructions. By dividing the total execution times (in 
seconds) into 1 0,000, these times can be expressed in 
thousands of Whetstone Instructions Per Second (KWIPS), 
as listed in Table 9. 



Section 6 — Whetstone Benchmarks 



Table 9. Whetstone Instructions Per Second (KWIPS) 



A subset of the Whetstone benchmarks is provided in 
Table 8 for a comparison of the A -Series and F-Series 
processors and other machines for which this bench- 
mark data is available. 



Bl 



BID 



1 

F-Series 


A600+ 


A700 


A900 1 


I 450 

1 


225 


490 


1042 | 


1 

I 245 


113 


340 


758 | 

1 



The Whetstone benchmarks were developed by the Na- 
tional Physical Laboratory in England. The Bl and BID 
benchmarks shown in Table 8 are referred to in the 
industry as the Whetstone single and double precision 
benchmarks, respecctively. The Bl and BID programs 
were written in FORTRAN using an algorithm derived 



Table 8. Whetstone Benchmarks 



BENCHMARK 


TIME (MINUTES) 


RTE-6/VM 


RTE-A W/VC+ 


F-Series 


A600+ 


A700 


A900 


Bl Single Precision Whetstone. Tests standard FORTRAN instruction 
set; not optimizable ; tests CPU and not compiler. 

BID Double Precision Whetstone. Double Precision version of Bl. 

B3 Tests compiler efficiency and Single Precision Floating Point 
by using Single Dimension Array Access. 

B4 Double Precision version of B3 . 

B5 Tests transcendentals by calling SQRT, SIN, COS, EXP, and ATAN. 

B6 Double Precision version of B5 with shorter loops. 

B9 Tests Floating Point using Binomial Expansion. 

B10 Tests compiler efficiency at analyzing IF statements. 

Bll Tests compiler accuracy using Double Precision Floating Point 
and transcendentals. 

SP2 Tests Single Precision Floating Point calculation speed. 

DP2 Tests Double Precision Floating Point calculation speed. 


0.37 

0.68 
0.88 

0.35 
0.40 
0.10 
1.08 
0.97 
0.97 

0.53 
0.77 


0.74 

1.48 
1.30 

0.75 
2.09 
0.23 
1.99 
0.99 
1.50 

1.17 
2.38 


0.34 

0.49 
0.82 

0.35 
0.27 
0.04 
0.97 
1.10 
0.60 

0.48 
0.66 


0.16 

0.22 
0.25 

0.11 
0.17 
0.02 
0.34 
0.32 
0.22 

0.17 
0.28 



Configuration of Systems Used 
for Performance Testing 

The tests made for this performance brief were run on 
systems configured as follows: 

Computers: 21 56B (A600+) w/0. 5 MB memory, 21 37A+ 
+001 (A700) w/floating point processor and 0. 5 MB 
memory, and 21 39A (A900) w/0. 75 MB memory 

Terminals: One 26 26 A Display Station connected to the 
system via a 12005B+005 interface as system console and 
one 2 6 26 A Display Station connected to the system via a 
12005B or 12040C interface, as appropriate, for the CPU ■ 
I/O extender performance comparison tests and other I/O 
tests. 

System Disc: 791 2P 28. 1 MB CS/80 Disc, connected to 
the system via a 12009A HP-IB interface. 

I/O Extender: 1202 5 A/B I/O Extender. 

I/O Interfaces: In addition to the 12005B interface used 
for connection of the system console and the 12009A 
interface to the system disc, a 1 200 5B, a 1 2009 A, a 
1 2006A, and a 1 2040C interface were used for I/O 
Extender tests. 



RTE-A Operating System: Revision 2440 RTE-A opera- 
ting system, installed as a standard primary system with 
minimal modifications, as follows: 



1 . Computer Type: 
Relocated: 



A600+ 
%RPL63 



A700 
%RPL73 



A900 
%RPL91 



2. For I/O tests, all drivers were relocated 
into the system to avoid incurring the 
overhead of mapping the drivers into a 
partition. 

3. Operating system modules %RTI0A and %I0M0D 
were relocated next to %EXEC to save base 
page links. 

4. For the partitioning tests, operating system 
modules %MEMRY and %L0AD were partitioned 

to provide a basis for measurement of the 
overhead involved in accessing partitioned 
RTE-A modules. 
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